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Abstract 

o 

q ' A novel method for computation of the discrete Fourier transform over a finite field with reduced 

multiplicative complexity is described. If the number of multiplications is to be minimized, then the 
novel method for the finite field of even extension degree is the best known method of the discrete 
Fourier transform computation. A constructive method of constructing for a cyclic convolution over 
a finite field is introduced. 



Index Terms 



■ Convolution, decoding, discrete Fourier transforms, error correction codes, fast Fourier trans- 

' forms, Galois fields, Reed-Solomon codes. 

(N ■ I. Introduction 

The Reed-Solomon codes are used to correct errors in digital storage and communication 
systems, and for many other applications. The discrete Fourier transform (DFT) over a finite 
field can be applied for encoding and decoding of the Reed-Solomon codes. 

Let us consider a + b (or a x b) to be an addition (a multiplication) only if both summands 
(factors) lie in the original field [5]; that is, operations in the prime field are neglected 
[4]. The main problem in this paper is to reduce the multiplicative (primarily) and additive 
(secondarily) complexity of the DFT computation over a finite field. In previous author's 
publications [9], [18], [10], [11] the new methods of the DFT computation (the cyclotomic 
and recurrent algorithms) are introduced. There are several papers with reduced additive 
complexity of the cyclotomic DFT (for example, [7], [21], [2]). 

In the author's recent paper [13] an idea and an example for a novel method with reduced 
multiplicative complexity of the DFT computation is first proposed. This novel method is 
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based on replacing a cyclic convolution by a multipoint evaluation. For construction of 
a multipoint evaluation we use the novel modification of the Goertzel-Blahut algorithm 
such that the polynomials on divisions levels have a small number of the coefficients in 
GF{2 m )\GF{2) (almost all coefficients are or 1). The algorithm of the DFT computation 
is valid for all composite extension degree m. Without loss of generality, we can assume that 
m is even. 

The paper is organized as follows. In section II, we give basic notions and definitions. 
In section III, we describe well-known algorithms for the DFT computation, the novel 
modification of the Goertzel-Blahut algorithm, and theorem about the multiplicative com- 
plexity coincidence of well-known algorithms. In section IV, we propose and prove the novel 
algorithm for the computation of a multipoint evaluation. In section V, we present the novel 
method for the DFT computation, which is based on replacing a cyclic convolution by a 
multipoint evaluation. Section VI presents examples illustrating the developed techniques. 

II. Basic notions and definitions 

The discrete Fourier transform of length n of a vector / = (/j), i E [0, n— 1], n \ (2 m — 1), 
in the field GF{2 m ) is the vector F = (Fj), 

^■ = E/*«°'» J'e[0,n-1], 

where a is an element of order n in GF(2 m ) (a transform kernel). Let us write the DFT in 
matrix form: 

F = Wf, 

where W = (a' tj ), i,j e [0, n — 1], is a Vandermonde matrix. 

We further assume that the length of the n-point Fourier transform over GF(2 m ) is n = 
2 m - 1. 

Every vector / is associated with a polynomial f(x) = YhZq fi%\ and we have Fj = f(a j ). 
The field of the computation is the finite field GF{2 m ). 
Let q be a primitive element of the field GF(2 m ). 
Definition. The binary conjugacy class of GF(2 m ) is: 

f~Ck „c k 2 „c k 2 2 ^c k 2 m k- 1 \ 

la , a k , a k , . . . , a k I , 
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where a is a primitive element of the field GF(2 m ), a Ck is a generator of the kth binary 
conjugacy class, m k is a cardinality of the kth binary conjugacy class, m k | m. More generally 
the g-ary conjugacy class of GF(2 m ) is: 

(a Ck ,a Ckq ,a Ckq2 ,...,a Ckqrnk ~ 1 ) . 

Let I be the number of binary conjugacy classes of GF(2 m ). 

Definition. The minimal polynomial over GF(q) C GF(2 m ) of /3 E GF(2 m ) is the lowest 
degree monic polynomial M(x) with coefficients from GF(q) such that M(j3) = 0. 
Let M k (x) be the minimal polynomial over GF{2) of the element a Ck . 
Let M k>i (x) be the minimal polynomial over GF(2 m ^ 2 ) of the element a Ck2 \ 
The matrix 
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is called a Vandermonde matrix. 
The matrix 



(«i *) , ij'e [l,t], 
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• a\ 
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a\ ■ 
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q 2 

at ■ 


q 



is called a Moore matrix [17]. 

The transpose of the Vandermonde (Moore) matrix we called the Vandermonde (Moore) 
matrix, too. 

If the matrix is a Vandermonde matrix and a Moore matrix at the same time, then it is 
called a Moore- Vandermonde matrix. 

A basis (3\ (3 2 , . . . , f3 mk ' r ) of GF{2 mk ) over GF{2) is a polynomial basis for the 
subfield GF(2 mk ) c GF(2 m ). 

Let 

/ 2° 2 1 2 2 2 m fe" 1 \ 
[lk , lk , lk , ■ ■ ■ , lk ) 
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be a normal basis for the subfield GF(2 mk ) c GF(2 m ). 

A circulant matrix, or a circulant, is a matrix, each row of which is obtained from the 
preceding row by a left (right) cyclic shift by one position. Let us denote by L k an m k x m k 
circulant the first row of which is a normal basis. We call it a basis circulant [10]: 



/ it 

it 



2 1 
lk 

2 2 
lk 



\lk " lk ••• Tk 
The basis circulant matrix is a Moore matrix for q = 2. 

Definition. A linearized polynomial over GF{2 m ) is a polynomial of the form 

L(x) = Y j a i x 2 \ di G GF(2 m ). 

i 

It can be easily seen that L(a + b) = L(a) + 1/(6) holds for linearized polynomials. 



omr, — 1 \ 

lk 

2° 

7fc 



(lV) , i,j e[0,m k -l]. 



J 



III. The algorithms for the DFT computation 
A. The Goertzel-Blahut algorithm 

We consider the Blahut modification for the DFT computation over finite fields [4] of 
Goertzel's algorithm [15]. 

The first step of the Goertzel-Blahut algorithm is a long division of f(x) by each minimal 
polynomial M k (x): 

f(x) = M k (x)q k {x) + r k (x), 
degr k (x) < degM k (x) = m k , 
k G [0,1-1], 

where r k (x) = V™^ 1 rj :k x j , I is the number of binary conjugacy classes. 

The second step of the Goertzel-Blahut algorithm is to evaluate the remainder at each 
element of the finite field: 

Fi = /(a 1 ) = r fc (a l ) = E^VS,*" ', 
i G [0,n - 1], 

where an element a 1 is a root of the minimal polynomial M k (x). 
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The matrix form for the first step is 



/ 



{rj,o), j e [0,m - 1] 
(r j; i), j e [0, mi - 1] 



= Rf, 



\ (rj,i-i), j e [0,mj_i - 1] J 
where R is an n x n binary matrix. 
The matrix form for the second step is 



((F Cfc2i ),ie [0,m fc -l]) =^((r iifc ),ie [0,m fc -l]), A;e [0,Z-1], 



where V* 



, i, j G [0, m fe — 1], is a Moore- Vandermonde matrix. 



The Goertzel-Blahut algorithm can be written in matrix form [12], [6] as follows: 

ttF = VRf, 



(1) 



where n is an n x n permutation matrix 



V 



v 



\ 



v l 



(2) 



is an n x n block diagonal matrix composed of Moore- Vandermonde matrices V k , k <E 
[0,1-1]. 

B. The cyclotomic algorithm 

The cyclotomic algorithm for the DFT computation [18] is based on representing the 
initial polynomial f(x) as a sum of linearized polynomials (cyclotomic decomposition of the 
polynomial), finding their values in a set of base points (cyclic convolution), and computing 
the resulting vector as a linear combination of these values with coefficients from a prime 
field (multiplication of a binary matrix by a vector). 

The cyclotomic algorithm consists of the following steps: 

0) decomposing an original polynomial into a sum of linearized polynomials 

f( x ) = J2 L k(x); 

1) evaluating linearized polynomials at a set of basis points 

{L k {li)Y, 
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2) components of the Fourier transform are computed as linear combinations of these 
values with coefficients from a prime field 

F.,- = 5>fc(7i), J6[0,n-1]. 

The preliminary step is the cyclotomic decomposition. 

n— 1 

The polynomial f(x) = ^ fax 1 , fa G GF(2 m ), can be decomposed as 

i=0 

2-1 m fc -l 
/W = E^( 2;Cfc )» L k(y) = E /c^^modn/, 

fc=o i=o 
where a Ck is a generator of the kth binary conjugacy class. 

Note, that the term f can be represented as L (x°), where L (y) = f y. 

Consider the derivation of the cyclotomic algorithm. We have 

i-i 

Fj = f(a j ) = ]T L k {a 1Ck ), a Ck G GF(2 m *), m k \ m. 

All elements (a Cfc ) J ' G GF(2 mk ) can be decomposed with respect to some normal basis 
{it, it , it , • • • , tP") of the subfield GF(2^): 

m k — l 

a jCk = E a kjsl 2 k \ a kjs G GF(2). 

Further, 

2-1 (m k -\ \ 2-1 m fc -l 

F i = E M E a fcjs 7f = E E (tT ) • 

fc=0 \ s=0 / fe=0 s=0 

Each of the linearized polynomials can be evaluated at the basis points of the corresponding 
subfield by the formula 

m k — 1 

L k (if) = E lt +P fc k 2^ s G [0, m k — 1], fc G [0, Z - 1]. 

p=0 

Components of the Fourier transform of a polynomial f(x) axe linear combinations of these 
values: 

2—1 rn k — 1 

F 3 = f(a j ) = E E (tT) 

fc=0 s=0 

2-1 m fc -l / m fe -l \ 

= E E a kj« E lk +P fc k 2P , i e [0,71 - 1]. 
fc=0 s=0 \ p=0 / 

In matrix form, the cyclotomic algorithm can be written as 

F — AL(irf), (3) 



December 8, 2011 



DRAFT 



where A is an n x n binary matrix, nf is a permutation of the initial vector /, and L is an 
n x n block diagonal matrix 



L 



\ 



(4) 



v Li -y 

composed of basis circulants L k , k E [0,1 — 1]. 

Multiplication of the block diagonal matrix L by the vector nf reduces to the computation 
of / normalized cyclic convolutions of small lengths m k , k E [0, 1 — 1]. 



C. The relation between the multipoint evaluation and normalized cyclic convolution 
An m-point cyclic convolution is a(x) = b(x)c(x) mod x m — 1. A normalized cyclic 

m— 1 

convolution of length m is a(x) = b(x)c(x) mod x m — 1, if J2 h = 1 [13]. The 

i=0 

computation of a normalized cyclic convolution can be represented as a multiplication by a 
basis circulant matrix L k . 

The multipoint evaluation for the polynomial t(x) and the point set {e^ | j e J}, is a 
computation t(ej), j G J [14, Chapter 10.1]. The computation of a multipoint evaluation for 
the point set {a± : a 2 , a 3 , ... , a t } can be represented as a multiplication by a Vandermonde 
matrix. If the point set is the binary conjugacy class, then the computation of a multipoint 
evaluation can be represented as a multiplication by a Moore- Vandermonde matrix V k . 

Consider the simple construction of a polynomial basis. 

Lemma 1 ([13]): Let (5 be an arbitrary nonzero element in GF(2 m ) whose minimal poly- 
nomial has degree m k . Then (/5°, /3 2 , . . . ,/3 m *~ 1 ) is a polynomial basis for the subfield 
GF(2 m *) c GF(2 m ). 

Proof: The proof is similar to the proof from [16, Chapter 4, Property (M4)]. The proof 
is by reductio ad absurdum. Consider the nonzero polynomial Ya^q 1 a i x \ a i GF(2), 
having j3 as a root. That is, /3 2 , . . . , /3 mk ~ 1 ) are linearly dependent and the minimal 

polynomial of j3 has degree less than m k , a contradiction. Therefore (/5°, /3 2 , . . . ,/3 mfc_1 ) 
are linearly independent. ■ 

Let ((a Ck )°, (a Ck ) 1 , (a Ck ) 2 , . . . , (a Cfe ) mfc_1 ) be a polynomial basis, while 
(7fc°,7fc\7?, • • • ^r^ 1 ) is a normal basis for the subfield GF{2 mk ) C GF(2 m ). 
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Let us construct the basis transformation matrix M k as follows: 



[a 



CfcU 



{a c *f 



Note that the m k x m k matrix M k is binary and nonsingular. 

Lemma 2 ([13]): The relation between the Moore- Vandermonde matrix \4 and the basis 
circulant L k is V k = \ //.. /./,. where M fc is the basis transformation matrix. 

Proof: Let V k = (a Ckj2l ^j, i,j G [0, m k — 1], G [0, Z — 1], be a Moore- Vandermonde 
matrix. L fe = (7 2I+J ), i,j G [0, m fe — 1], /c G [0,/ — 1], is a basis circulant matrix. 

Using (5), we have 



M, 



/ 2° \ 
2 1 
2 2 

7fe 



omt, — 1 

V7l / 



(5) 



/ ( a c fe )020 

(a c *) 12 ° 



(a Cfe ) 



c^02! 



(a Ck ) 



c k \02 m k- 



\ 



lot 



[a Ck f 21 




• (a *) 12 *"* -1 




-1)2 X 


. ( Q ,c^(m fe -l)2-fc- 1 


( it 

it 


2 1 

7fe •• 

2 2 

ll •• 


omr, — 1 \ 

• 7l 

2° 

• 7fc 




V 7f fc " 


2° 

7fe •• 


• ll j 





= M k 



and = M k L k . U 
Lemma 3 ([13]): For mi = m fc , the relation between two Moore- Vandermonde matrices 
is V k = V 1 (M'[)~ 1 M k T , where Mi and M k are the basis transformation matrices. 

Proof: Using V k T = M k L k , we obtain V k = L\Ml = L k M^, L k = V k (M^)-\ For 
mi = m k , we get L\ = L k and 



\4 = L k Ml = L x M T k = V^Mly'Ml 



Thus the multiplicative complexity of a normalized cyclic convolution and a multipoint 
evaluation computation is the same. They differ only by preadditions matrix. 
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D. The relation between the Goertzel-Blahut and cyclotomic algorithms 
We introduce the following Theorem. 

Theorem 1 ([13]): The multiplicative complexity of the Goertzel-Blahut algorithm and 
the cyclotomic algorithm is the same. 

Proof: The block diagonal matrix V (2) from the Goertzel-Blahut algorithm (1) is 
factorized into a product of the block diagonal matrix L (4) from the cyclotomic algorithm 
(3) and a binary block diagonal matrix. 

Using V k = L k M%, (2), and (4), we have 



V 



Ml 



V 



Mf-i) 



This means that the multiplicative complexity of both algorithms is the same. 



E. The modification of the Goertzel-Blahut algorithm 

We introduce the modification of the Goertzel-Blahut algorithm. The algorithm is valid 
for each composite extension degree m. Without loss of generality, we can assume that m 
is even and m > 2. 

The first step is a long division of f(x) by each minimal polynomial M k (x), k G [0, Z — 1], 
over GF(2): 

f(x) = M k (x)q k (x) +r k (x), degr k (x) < degM k (x) = m k , 

k e [0,1-1], 

where r k (x) = ^ fj,kX j , I is the number of binary conjugacy classes. 

The second step for even m, m > 2, is a long division of each r k (x), k e [0, / — 1], by 
each minimal polynomial M kjP (x), p G [0, l k - 1], over GF(2 m/2 ): 

r k (x) = M k , p (x)q Kp {x) + s k>p (x), degs k<p (x) < degM ktP (x), 
ke[0,l-l], P e[0,l k -l], 



where s kp (x) 



degM feiP (z)-l 

Sj jkiP x j , l k is the number of 2 m / 2 -ary conjugacy classes inside 



3=0 



the kth binary conjugacy class. 
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The third step is to evaluate the remainder at each element of the finite field: 

degM kiP (x)-l 

Fi = /(a*) = r^tf) = s fciP (a i ) = s j,k,P alJ > 

3=0 

i e [0,n - l], 

where an element a' 1 is a root of both minimal polynomials M k (x) and M kjP (x). 

The second and third steps of this algorithm can be used for the computation of a multipoint 
evaluation. 

IV. The multipoint evaluation for the binary conjugacy class 

Without loss of generality, we can assume that the binary conjugacy classes of GF(2 m ) 
have cardinality m. The binary conjugacy classes of cardinality m 8 < m are considered for 
the subfield GF(2 mk ) c GF(2 m ). 

Consider the multipoint evaluation for the polynomial t(x) = Y^o Ux l an d the binary 
conjugacy class (a> Ck , ct Cfe2 , a Ck22 , . . . , a Ck2m ^ of GF(2 m ). We compute 

Ti = t(a Ck21 ) , i e [0,m-l]. 
Let us write the multipoint evaluation in matrix form: 



( t (a^°) " 
t(a Ck21 ) 
t (a Ck22 ) 

) 

where 



c fc 2° 


a Ckl2 ° 


a Ck22 ° 


. Q ,Cfc(m-l)2° 


Cfc0 2! 


a Ck121 


a Ck22 ' 


Q ,Cfc(m-l)2 1 


C fc0 2 2 




a Ck22 ' 2 


. a c k (m-l)2' 2 


c k 02™- 1 


a c fc 12—i 


a Ck22m ~ x ■ 


. a c k (m-l)2 m - 1 



( 



J 



\ 



to 

tl 
t 2 

\ t m -l J 



T = V k t, 

T = (Ti), t = (ti), i e [0,m-l], V k = (a Ckj2 ^, i,j G [0,m-l], is a Moore- Vandermonde 
matrix. 

Let us formulate the main result for the computation of a multipoint evaluation. 

Theorem 2: For any finite field GF(2 m ) with even m exists the binary conjugacy class of 
cardinality m, the multipoint evaluation for this class reduces to two multipoint evaluations 
for the subfield GF(2 m ^ 2 ) and m/2 extra multiplications. 

The multipoint evaluation matrix is shown in formula (9), and the number of multiplications 
is shown in formula (10). 

In the rest of this section we consider the proof of Theorem 2. 
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A. The properties of the binary conjugacy class 

Lemma 4: Let a Ck be a generator of the kth binary conjugacy class 

of cardinality m, m is even, a is a primitive element of the 
field GF(2 m ), and GF(2 m ^ 2 )[x] is a polynomial ring (the ring of polynomials over field 
GF(2 m ' 2 )). The polynomials 

(x - a CkT ) (x - a Ck2m/2+l ) G GF(2 m / 2 )[x\, % G [0,m/2 - 1], 

are the different, irreducible, and minimal polynomials over GF(2 m ^ 2 ). 

Proof: Let M k ^x) = (x - a CkT ) (x - a Ck2m/2+l ) 
= x 2 + (a Ck2 * + (V* 21 ) J x + ^ 2, ( 2m/2+1 ) = x 2 + (/3 + /3 2 ) x + <5 2 \ where A = a c * 2 \ 
i G [0,m/2 - 1], 5 = (a 2m/2+l ) Ck G GF(2 m/2 ). 

The element + /3 2m/2 ^j is a root of the polynomial x 2m/2 + x, hence (fii + /3 2m/2 ) G 
GF(2 m/2 ). 

Thus the polynomials M kji (x) G GF(2 m / 2 )[x], i G [0,m/2 - 1], are the minimal and 
irreducible polynomials over GF{2 m / 2 ). 

Using M k (x) = n^o 1 ( x — « Cfc2! ) = IT^cf -1 M k:i (x), we see that all the polynomials 
M kji (x) are different. ■ 

Let us formulate the condition that the minimal polynomial M k:i (x) over GF(2 m l 2 ) has 
only one coefficient in GF(2 m ' 2 )\GF(2). 

Lemma 5: If a Ck = a Ck2m/2 + 1, m is even, for the binary conjugacy class 
(a Cfc , a Cfc2 , a Cfe22 , . . . , a Ck2,m then 

1) a Cfc is a root of the polynomial x 2m/2 + x + 1; 

2) « c * 2! = a c * 2m/2+t + 1 for all % G [0, m/2 — 1]; 

3) the minimal polynomials over GF(2"^ 2 ) are 

M fc>i (x) = (s - c^ 2 *) (x - a Cfc2m/2+l ) = x 2 + x + ^( 2l + 2 '" /2+l ) = x 2 + x + 8 2 \ 

where 5 = (a 2m/2+1 ) Ck G GF(2 m/2 ), i G [0,m/2 - 1]; 

4) the minimal polynomials M kji (x), i G [0,m/2 - 1], over GF(2 m/2 ) are different. The 
elements 5 2 \ % G [0, m/2 — 1] are different, too; 

5) the minimal polynomial over GF(2) is 

m— 1 m/2— 1 m/2— 1 

M k (x) = Y[ (x - a Ck2% ) = 1] M fc)i (a;)= (x 2 + x + 5 2 ') . 

i=0 i=0 i=0 

Proof: The proof is trivial. ■ 



December 8, 2011 



DRAFT 



12 



B. The binary conjugacy class choice 

Theorem 3: For any finite field GF{2 m ) with even m exists the binary conjugacy class 
(a c \ a Ck2 , a Ck2 \ . . . , a * 2 ™' 1 ) such that a c " = a Ck2m/2 + 1. 
To prove this Theorem, we need two Lemmas. 
Lemma 6: For any finite field GF{2 m ) with even m: 

om/2 | om 

X + X + 1 | X + X. 

Proof: The proof is similar to the proof from [3, Theorems 11.34 and 11.35]. Let 

L(x) = x 2m/2 + x, A(x) = L(x) + 1 = x 2m/2 +x + l, and B{x) = (L(x)f m/2 ' 1 + 1. 
We obviously have L(x)B(x) = x 2 ™ + x. 

Let r be a root of the polynomial A(x). Since L(r) = 1 and B(r) = 0, we see that all 
roots of the polynomial A(x) are roots of the polynomial B(x). Hence, A(x) \ B(x) and 
A(x) | x 2 " 1 + x. ■ 

Lemma 7: For any finite field GF(2 m ) with even m exists the minimal polynomial M k (x) 
of degree m over GF(2) such that 

M k (x) \x 2 +x + l. 

Proof: Let Irreducible (i) be a number of irreducible binary polynomials of degree i. 
From [3, Theorem 3.35] it follows that m Irreducible (i) > 2 m — 2 m / 2+1 . The number of the 
finite field GF(2 m ) elements belonging to the binary conjugacy classes of cardinality m is 
m Irreducible (m). The number of the finite field GF{2 m ) elements without the roots of the 
polynomials x 2m/2 + x and x 2 ™ 2 + x + 1 equals 2 m — 2 m / 2+1 . Since there are no elements 
belonging to the binary conjugacy class of cardinality m within the subfield GF(2 m l 2 ), the 
set of the polynomial x 2 " 1 ' 2 + x + 1 roots contains at least one element (and hence the binary 
conjugacy class), which belongs to the binary conjugacy class of cardinality m. The minimal 
polynomial over GF{2) for this binary conjugacy class divides the polynomial x 2 " 1 ' 2 + x + l. 

U 

This completes the proof of Theorem 3. 

The binary conjugacy class choice consists of two steps: 

1. construct the minimal polynomial M k (x) of degree m over GF(2) such that 

M k (x) | x 2m/2 +x + l; 
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2. choose a root a Ck of the minimal polynomial M k (x) as a generator of the A;th binary 
conjugacy class (a Ck , a Ck2 , a Cfc22 , . . . , a Ck2m ^. 

C. Two divisions levels in the multipoint evaluation 

The second and third steps of the modification of the Goertzel-Blahut algorithm includes 
the upper and lower levels of divisions in the multipoint evaluation, respectively. From 
Lemma 5 it follows that the quadratic polynomial M kji (x) has only one coefficient in 
GF(2 m ' 2 )\GF(2), and a division by this polynomial is very simple. 

The upper level of divisions is a long division of polynomial t(x) = X^Lo* ^i x% by eacn 
quadratic polynomial M k:i (x) = x 2 + x + 5 2 \ % e [0, m/2 — 1]: 

t( x ) = (x 2 + x + <5 2 ') qi(x) +Ui(x), ie [0, m/2 - 1], 

where ^(x) = u hl x + ^, , 5 = (a 2m/2+l ) Ck E GF(2 m/2 ). 

The lower level of divisions is evaluating the remainder Ui(x) at pair conjugates a CkT and 
a Ck2m/2+t (with respect to GF{2 m/2 )) for all i e [0,m/2 - 1]: 

D. r/ze matrix form of two divisions levels in the multipoint evaluation 
We decompose the multipoint evaluation matrix into two factors 

^fe Ui ower U U pp er: 

where Ui ower , U upper are nonsingular. 

1 ) The lower level of divisions: Consider the matrix form of the lower level of divisions 
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T 
T 2 



T 



m/2-1 



'-m/2 



T 



m/2+1 
P m/2+2 



- m— 1 



= C// 



lower 




V> m /2-l,0 
\ u m/2-l,l ) 













Ulower 










1 


a Cfe 

















•• 














1 


a Ck2 











•• 




















1 







•• 





























•• 


1 


a c k 2^- 


1 



















•• 














1 


a c * 2W2+1 











•• 




















1 







•• 





























•• 


1 





Im/2 





Im/2 


Im/2 







\ 




a Ck2 




Im/2 


a Ck22 













Im/2 


) 



n. 



V 

where I m / 2 is the (m/2) x (m/2) identity matrix, O is the (m/2) x (m/2) all-zero matrix, 
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the permutation matrix is 



n = 



1 

















•■ 


• 














1 











■ ■ 


• 




















1 





•■ 


• 


























•■ 


• 


1 








1 














•■ 


• 

















1 








■ ■ 


• 























1 


•• 


• 









V 



0000000 •••00 1 

or n= (Uij), i,j e [0,m-l]: 

1, if (j = 2i) and i e [0,m/2 - 1], 

1, if (j = 2i + 1 - m) and % e [m/2, m - 1], (6) 
0, otherwise. 

2) The upper level of divisions: Consider the matrix form of the upper level of divisions 

for m > 2 



lUj = < 




upper 



to 

tl 

\ t m -i J 



U m /2-l,0 
y Mm/2-1,1 J 

Theorem 4: The matrix U upper can be represented as 



U = IT -1 

^ upper 



A 


O 


O 


A 
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( £0 2° 


5 l2 ° 


5 22 ° 


5° 21 




5 221 


£0 2 2 


5 122 


5 22,2 



where m is even, m > 2, the permutation matrix II is defined in formula (6) 

£(m/2-l)2° \_ 
^(m/2-l)2 1 
£(m/2-l)2 2 

^ 02 m/2-l ^ 12 m/2-l ^ 22 m/2-l ^ §(m/2- 1) 2 m / 2 ~ 1 

= (^), i,je[0, m /2-l], 

is a Moore-Vandermonde matrix, 5 = (a 2m/2+1 ) Ck E GF(2 m/2 ), O is the (m/2) x (m/2) 
all-zero matrix, the binary matrix £> is defined in Lemma 10. 
To prove this Theorem, we need several Lemmas. 

Lemma 8: Polynomial r\x + r is the remainder polynomial of the polynomial t(x) = 
YjT=o ti xl > rn > 2, when divided by the quadratic polynomial x 2 + x + e, e G GF(2 m ), in 
characteristic two finite field. Then 



to 
t 2 

\ t m -i J 



where the remainder matrix is 



U 



r 0,0 r 0,l r 0,2 ' ' ' r 0,m-l 



n,o ri,i ri >2 ••• ri, m _i y 
0,<-i + er h *-2, i e [0, 1], i > 2, 



with initial conditions 



r i,o 


= 0, 




= 1, 


»"0,0 


= 1, 


/0,1 


= 0. 



Proof: Let A and // be roots of the polynomial + x + e. 
We have x 2 + x + e = (x — A) (re — ji) = x 2 + (A + /x)x + A// and < 



A + /x = 1 
A/i = e 
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\t(X)=r 1 X + r 

From t(x) — (x + x + e)q(x) + {r x x + r ) it follows that < =>- 

ri = t(X) + = ES 1 UX + o 1 ^ 
r = t(A) + nA = t(\)n + t(/i)A = YT=o l + ^A) 

ri = E™o 1 ^(A l + /i l ) = E™o 1 ^M 

r = t + E™! 1 ^(A- 1 + /i^ 1 ) = E™ o 1 *in>,i 
Hence, we obtain 

(A 4 - 1 + m* _1 )(A + m) - A M (V- 2 + ^- 2 ) = n,i_i + er M _ 2 , if i e [2, ro - 1], 

n,i = A* + /u 4 = I o, if j = 0, 

1, if i = 1 . 

that is, 



Further, we obtain r i 



that is, 



n,i = r M _i + er M _ 2 , i > 2. 

e(A i ~ 1 + /i^ 1 ) = er M _i, if i G [1, m - 1], 
1, if i = 0, 



Using (7) and (8), we get 



n>,< = r o,i-i + er 0)i _2, i > 2. 



(7) 



(8) 



Corollary 1: If m is even and e = 5 = 5 2 ° = (a 2m/2+1 ) * G GF(2 m/2 ), then all elements 
of the remainder matrix 



U = U(0) 



r ,o r 0) i r , 2 



r 0,m-l 



/ 



n,o n,i ri >2 ••• ri, m _i 
belong to the subfield GF{2 m ' 2 ) C GF(2 m ). 

Corollary 2: If m is even and e = 8 2 \ then the remainder matrix is U(i) = [U(0)} T 
[U] 2 \ where matrix [U] j consists of jth degree of all elements of the matrix U. 
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Lemma 9: Let us 5 = (« 2m/2+1 ) Cfc G GF(2 m/2 ), m is even, m > 2, 
then (<5°, <5\ <5 2 , . . . , 5" 1 / 2 ' 1 ) is a polynomial basis for the subfield GF(2 m/2 ) c GF(2 r 



Proof: Consider a (m/2) x (m/2) Moore- Vandermonde matrix A = ^ 2 J , 2, j G 
[0, m/2 — 1]. Since all elements 5 2 ' , iG [0, m/2 — 1], are different (see Lemma 5, Property 4), 
it follows that the Vandermonde matrix A is nonsingular. The Moore matrix A is nonsingular 
if and only if (S°, 5\ 5 2 , . . . , 5 m ' 2 - 1 ) is a basis [16, Chapter 4, Lemma 18]. ■ 
Lemma 10: If the remainder matrix U is defined in Lemma 8 for e = 5 

jj — £/(o) — r °'° r0,1 r °' 2 ' ' ' r °' m_1 

v 7-1,0 n,i n )2 ••• ri, m _i 

m is even, m > 2, (<5°, <5\ <5 2 , . . . , 5™/ 2 - 1 ) is a polynomial basis for the subfield GF(2 m/2 ) 
GF(2 m ), and 5 = (a 2m/2+1 ) Cfc G GF(2 m/2 ), then there is the nonsingular binary matrix B 
such that 



C 



UB 



5° 





5 l 





5 2 







fim/2-1 








5° 





5 1 





5 2 







£m/2-l 



Proof: Let the matrix C/j consist of the first 2(j + 1) columns of the matrix U 

\ 



r ,o 


r o,i 


^0,2 • • • 


r 0,2j 


r 0,2j+l 


r i,o 


r i,i 


r-1,2 ■ ■ ■ 


n,2j 


r i,2j+i 



We use mathematical induction on j. 

1. The basis. For j = 1 the matrix U\ is C/i 



1 


5 5 


1 


1 5+1 



J 



2. The inductive step. For j there are next properties: 

a) for each of the elements r 0i2 j, r lj2 j, ?"o,2j+i, 7"i,2j+i there are binary coefficients 
i G [0,j], s G [1,4], such that 

al) r , 2 j = + Ei=o 

a 2) r 0,2j+l — J2i=0 &i,2$ l , 

a3) ri j2 j = Ei=o a i,3<5\ 

a4) n^j+i = 5 j + ECo °i,4^; 

b) there exists the nonsingular binary matrix Bj such that 



5° 





5 1 





<5 2 







5 j 








5° 





5 1 





5 2 







5i 



For j + 1 there are: 
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?~0,0 


r o,i 


r ,2 ■ ■ ■ 


r o, 2j 


r 0,2j+l 


r 0,2j+2 


r 0,2j+3 


r i,o 


n,i 


r 12 . . . 


T \2j 


r i,2j+i 


r l,2j+2 


r l,2j+3 / 



the matrix C/j+i = 

c) for each of the elements ^,2^+2,^0,2^+3,^1,2^+2,^1,2^+3 there are binary coefficients b iyS , 
i e [0,j + 1], s e [1,4], such that 

cl) using Lemma 8 and Properties (al,a2), we get r 0t2j+2 = r , 2 j+i + $ r o,2j = Ei=o°i,2^ + 

5 + Eto oi,!**) = + Di=0 

c2) using Lemma 8 and Properties (a2,cl), we get r ,2j+3 = r o,2j+2 + ^o,2j+i = 

Eto + S n=o Oirf* = £i=o &i,2<*\ 

c3) using Lemma 8 and Properties (a3,a4), we get ri :2j+2 = r i,2j+i + ^i,2j = & + 

EU (HaP + S EU HsS 1 = ELo fti.s**, 

c4) using Lemma 8 and Properties (a4,c3), we get r h2j+3 = n,2j+2+^i,2j+i = ELo &i,3<^+ 

5 + Eto = + ELo M'; 

d) there exists the nonsingular binary matrix Bj +1 such that 



U j+1 B j+1 



' 5° 





5 l 





5 2 







5i 





#+1 








5° 





5 l 





5 2 







& 





5 3+i 



Proof of Theorem 4: The remainder matrix U (i) is calculated in Lemma 8 for a long 
division of the polynomial t(x) = E^o 1 tjX 3 , m > 2, by the quadratic polynomial x 2 + x + 
5 2 \ i G [0,m/2 - 1]. 

From Corollary 2 it follows that U(i) = [U(0)f = [Uf . Then 





( u(o) ^ 




( U \ 




C/(l) 




[U] 2 


U U pper 


C/(2) 




[Uf 




( C/(m/2 - 1) J 







Using Lemma 10, we have 



U upper B 
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' 5° 

o 




6° 


s 1 
o 




i 1 


,5 2 

o 




gm/2-1 

o 




g ™/2-l 







s 021 


J" 1 







J 22 ' 











gl, m /2-l)2 1 







,0 2 2 








J 22 " 



























j02™/2-1 

> o 




s02m /2-l 


4l2 m /a -l 





4 12 / 


J2 2™/ 2 ' 1 





(5 (m/2-l)2'"/ 2 - 1 





s (m/2-l-)2™/ 2 - 1 



Finally, we obtain 



= n- 1 










A 



n. 



u = n -1 

^ upper 



' A 








A 



TIB- 1 . 



3) The multipoint evaluation matrix: The multipoint evaluation matrix for even m, m > 2, 



is 



upper 







( 


a c " 2 






1 m I '2 


N 


Im/2 









Im/2 


I m /2 J 




,m/2-l 


ft 


A 






V o 


Im/2 J 







UB-\ (9) 



where J m / 2 is the (m/2) x (m/2) identity matrix, O is the (m/2) x (m/2) all-zero matrix, 
the permutation matrix n is defined in formula (6), A = ((P 2 *), i,j G [0,m/2 — 1], is a 
Moore- Vandermonde matrix, 5 = (a 2m/2+i y k E GF(2 m/2 ), the binary matrix B is defined 
in Lemma 10. 



E. The complexity of the multipoint evaluation 

The recursive formula for the number of multiplications of the multipoint evaluation is 

Mult(m) = 2 Mult(m/2) + m/2, (10) 
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TABLE I 

The complexity of the multipoint evaluation 





old methods 


novel method 


m 


multiplications 


multiplications 


1 








2 


1 


1 


3 


3 


— 


4 


5 


4 


5 


9 


— 


6 


10 


9 


7 


12 




8 


19 


12 


9 


18 




10 


28 


23 


11 


42 




12 


32 


24 



with initial condition Mult(l) = 0. 

This recursion is satisfied by Mult(m) = |mlog 2 m for m — 2\ % > 0. 

Let us remember (see Lemma 2) that the multiplicative complexity of a normalized cyclic 
convolution and a multipoint evaluation computation is the same. The complexity of the 
multipoint evaluation for old methods [4], [21, Table 1], and for even m of the novel method 
is shown in Table I. 



F. Examples 

1) m — 2: Let Cfc=l, while the binary conjugacy class is (a 1 , a 2 ) of GF(2 2 ). 
There is the lower level of divisions only. 



V 1 



1 a 
1 a 2 



i \ 



1 
1 1 



1 a 
1 
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2) m = 4: The finite field C7F(2 4 ) is defined by an element a, which is a root of the 
primitive polynomial x 4 + x+l. Let c k =l, while the binary conjugacy class is (a 1 , a 2 , a 4 , a 8 ) 
of GF{2 4 ). 



x 



/ 


1 


a 1 


a 2 


a 3 


\ 








/ 


1 











\ 




/ 


1 





a 


\ 




l 


2 

a 


4 

a 


6 

cc 















1 

















1 





a 2 




l 


a A 


a s 


a 12 












1 





1 

















1 





V 


1 


a s 


a 1 


a 9 


J 








V 





1 





1 


/ 




V 













1 ) 


/ 


1 


a 5 








\ 




( 


1 











\ 




/ 


1 











\ 






1 


a 10 




















1 














1 


1 


1 














1 


a 5 











1 




















1 


1 






V 








1 


a 10 


J 




\ 











1 


/ 




I 











1 


/ 





Using 5 = a 5 , we have 



A = 



1 1 of \ 


(i 




< i 


:) 


/ 


l 


:) 


\ 1 q,i ° ) 






{ 1 




V 








V. The novel method for the DFT computation 

Consider several constructions of the novel method on the basis of different algorithms. 

The binary conjugacy classes of cardinality mi < m are considered for the subfield 
GF{2 mi ) C GF(2 m ). If rrii is even then we can apply formula (9) for multipoint evaluation 
matrix construction. In other cases we can use the classic methods (for example, [4]) for the 
multipoint evaluation or normalized cyclic convolution computation. We further assume that 
a cardinality of the binary conjugacy classes is m. Let J m be the indices set of the binary 
conjugacy classes of cardinality m. 

According to Theorem 3, we can choose the A;th binary conjugacy class for which exists 
the multipoint evaluation matrix V k (9). 

A. The novel method based on the Goertzel-Blahut algorithm 

Consider the construction of the novel method based on the Goertzel-Blahut algorithm 
(formulae (1) and (2)) for even m. From Lemma 3 it follows that for all binary conjugacy 
classes of cardinality m, we have 

Vj = V k {Ml)- l Mj, 
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where j e J m , Mj and M fc are the basis transformation matrices. 
Using (2), we have 



V 



V 



\ 



V l 



Vi-x 



v k 



v k 



\ 



■J 



\ 



DP, 



7 



where J m = {ji, j 2 , . . .}, D is the block diagonal matrix of the multipoint evaluation matrices, 
and P is the binary block diagonal matrix of combined preadditions. 
In matrix form, the DFT algorithm can be written as 



nF = VRf = D(PR)f. 

where it is the permutation matrix, D is the block diagonal matrix of the multipoint evaluation 
matrices, P is the binary block diagonal matrix of combined preadditions, R is the binary 
matrix. 

The algorithm of length n = 2 m — 1 over GF(2 rn ) takes two steps: 

1) The first step is multiplying the binary matrix PR by the vector / over GF(2 m ); 

2) The second step is calculation of / m-point multipoint evaluations. 

B. The novel method based on the cyclotomic algorithm 

Consider the construction of the novel method based on the cyclotomic algorithm (formulae 
(3) and (4)) for even m. From Lemma 2 it follows that for all binary conjugacy classes of 
cardinality m, we have 

Lj = L k = V k {Ml)-\ 

where j E J m , M k is the basis transformation matrix. 
Using (4), we have 



L 



\ 
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v k 



V 



7 



(MJ)- 1 



V 



DPr 



7 



24 



(11) 



where D is the block diagonal matrix of the multipoint evaluation matrices, and P c is the 
binary block diagonal matrix of combined preadditions. 
In matrix form, the DFT algorithm can be written as 



F = W(irf) = AL(nf) = ADP c (nf), 

where W = (a lj ), i,j G [0, n — 1], is a Vandermonde matrix, A is the binary matrix, D 
is the block diagonal matrix of the multipoint evaluation matrices, P c is the binary block 
diagonal matrix of combined preadditions, n is the permutation matrix. 

We see that the algorithm contains two multiplications of the binary matrix by the vector. 
To reduce the complexity, we consider a modification of this algorithm. 

1 ) The novel method based on the inverse cyclotomic algorithm: Let us write the inverse 
DFT over the field GF(2 m ) in matrix form: 

F = W- l (7T l f), 

where W = (a lj ), i,j G [0,n — 1], is a Vandermonde matrix, ni is the permutation matrix. 
The matrix L^ 1 is a block diagonal matrix composed of basis circulants [8]. 
Similarly (11), we have 

L- 1 = DP h 

where D is the block diagonal matrix of the multipoint evaluation matrices, and Pj is the 
binary block diagonal matrix of combined preadditions. 
Finally, we obtain 

F = W-\mf) = (AL)- W) = L^A-^mf) = D(P i A~ 1 )('Kif) : 

where D is the block diagonal matrix of the multipoint evaluation matrices, Pi is the binary 
block diagonal matrix of combined preadditions, A is the binary matrix, ni is the permutation 
matrix. 

The algorithm consists of the multiplication of the binary matrix by the vector and calcu- 
lation of / m-point multipoint evaluations. 
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2) The novel method based on the recurrent algorithm: The recurrent algorithm [10] 
is a modification of the cyclotomic algorithm. This algorithm has a regular structure of 
the transform matrix A r L. The multiplicative complexity of the cyclotomic and recurrent 
algorithms is the same. 

Substituting (11) into matrix form of the recurrent algorithm, we have 

n r F = A r L(n r f) = L T A T r {<K r f) = LA T r {n r f) = D{P c A T r )^rf), 

where D is the block diagonal matrix of the multipoint evaluation matrices, P c is the binary 
block diagonal matrix of combined preadditions, A r is a binary matrix with a regular structure, 
7r r is the permutation matrix. 

The algorithm consists of the multiplication of the binary matrix by the vector and calcu- 
lation of I m-point multipoint evaluations. 

C. The complexity of the DFT computation for even extension degree finite field 

It is well-known that the number of the binary conjugacy class of cardinality % and 
irreducible binary polynomials of degree % is the same [16]. The number of irreducible 
binary polynomials of degree % is [16, Chapter 4, Theorem 15] 

Irreducible (i) = - £ fi(d)2 l/d , 
1 d\i 

where /j,(d) is the Mobius function. 

The number of multiplications of the novel method for the (2 m — 1) -point DFT computation 

is 

Mult(i) Irreducible (i), 

i\m 

where Mult(m) is the number of multiplications of the multipoint evaluation (10). 

The complexity of the n-point DFT computation for even m of the cyclotomic algorithm 
and novel method is shown in Table II. The number of multiplications for the cyclotomic 
algorithm for lengths 15, 63, and 255 is cited from the original paper [18], for lengths 1023 
and 4095 it is calculated in the paper [21, Table 4]. 

For the multiplication of the binary matrix by the vector, we can use the modified "four 
Russians" algorithm (V. L. Arlazarov, E. A. Dinits, M. A. Kronrod, and I. A. Faradzhev)[l, 
Algorithm 6.2] for multiplication of Boolean matrices, with complexity less than 2n 2 / log 2 n 



December 8, 201 1 



DRAFT 



26 



TABLE II 

The complexity of the ti-point DFT computation 





cyclotomic algorithm 


novel method 


n 


multiplications 


multiplications 


15 


16 


13 


63 


97 


88 


255 


586 


373 


1023 


2827 


2332 


4095 


10832 


8140 



additions over elements of GF(2 m ). On the other hand, we can use the heuristic algorithms 
(for example, [7], [20], [19]), whose complexity we could not estimate. Note that since the 
matrix of combined preadditions is multiplied by a binary matrix, it follows that the algorithm 
complexity does not depend on preadditions. 

The asymptotic complexity of the calculation of I m-point multipoint evaluations is 

0(lm log 2 m) = O (^—m log 2 m^j = 0(n log 2 log 2 n) 

multiplications and additions over elements of GF(2 m ). 

VI. Examples 

Consider the DFT of length n = 15 over the field C7F(2 4 ). The finite field C7F(2 4 ) is 
defined by an element a, which is a root of the primitive polynomial a; 4 + x + 1. Let us take 
the primitive element a as a transform kernel. The binary conjugacy classes of GF(2 m ) are: 
(a ), (a\a 2 ,a\a 8 ), (a 3 , a 6 , a 12 , a 9 ), (a 7 ,a 1 \a 13 ,a 11 ), (a 5 , a 10 ). 

A. The Goertzel-Blahut algorithm 

The first step of the Goertzel-Blahut algorithm is 
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f / \ 

f(x) = 




(x + l)qo{x) +r {x), r (x) = 




r ,o 


/(*) = 




(x 4 + x + l)qi(x) +ri(x), r\(x) = 


r 3j ix 3 


+ r 2) \x 2 + r hl x + r ,i 


m = 


(x 4 + X 3 


+ x 2 + x + l)q 2 (x) +r 2 (x), r 2 (x) = 


r 3j2 x 3 


+ r 2i2 x 2 + r 1;2 x + r , 2 


m = 




(x 4 + x 3 + l)q 3 (x) +r 3 (x), r 3 (x) = 


r 3i3 x 3 


+ r 2<3 x 2 + r 1>3 x + r , 3 


m = 




(x 2 + x + l)q<i(x) +r 4 (x), r 4 (x) = 




r M x + r , 4 


where 


?~0,0 


= Sj=0 fa 








r o,i 


= fa + fa + fa + fa + flO + fl2 + /l3 


+ /l4 






n,i 


= /i + fa + fa + h + fa + ho + fn - 


f fu 





?~2,1 = A + fa + /e + /s + /io + /ll + /l2 + /l3 

^3,1 = /3 + / 6 + / 7 + fa + Al + /l2 + /l3 + /l4 

?~0,2 = /o + /4 + /5 + fa + flO + fu 

ri, 2 = fi + fa + fa + fa + /n + /i4 

^2,2 = f2 + fa + fl + fa + fl2 + fu 

r 3 ,2 = fa + fa + fa + fa + fl3 + fu 



ro,3 


= fo + fa + fa + fa + fa + fa - 


+" /ll " 


f /l2 


r l,3 


= fl + fa + fa + fa + fa + flO 


+ /12 


+ /13 


7*2,3 


= fa + fa + fa + fa + fa + fax 


+ /13 


+ /14 


^3,3 


= fa + fa + fa + fa + fa + Ao 


+ /ll 


+ /14 



Toa = fa I fa I fa I fa I fa I fa I fa I /n + /12 + /i4 

r M = fa + fa + fa + fa + fa + fa + flO + /ll + /l3 + /l4- 

The second step of the Goertzel-Blahut algorithm is 
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^0 


= f(oP) 


= r 0,0 






Fi 


= f(^) 


= n(a 1 ) 





r 3j ia 3 + r 2A a 2 + r^a 1 + r ,i 


F 2 


= iV) 


= ri(a 2 ) 





r 3j i« 6 + r 2i i« 4 + r ltl a 2 + r 0j i 


F 4 


= /(« 4 ) 


= r x (a 4 ) 


_ 


^3,ia 12 + r 2 ,ia 8 + r ltl a 4 + r ,i 


F 8 


= /(a 8 ) 


= ri(a 8 ) 





^3,i a9 + ^.ltt 1 + ^i,i« 8 + ^0,1 


F 3 


= /(a 3 ) 


= r 2 (a 3 ) 





^3,2« 9 + ^2,2« 6 + r h2 a 3 + r , 2 

7 ? 7 7 


F 6 


= /(« 6 ) 


= r 2 (a 6 ) 





^3,2tt 3 + ^2,2« 12 + ^l,2a 6 + ^0,2 


F\2 


= /(a 12 ) 


= r 2 (a 12 ) 





r 3 2 a 6 + r 2 2 a 9 + n 2 a 12 + r , 2 

I 7 7 7 


Fg 


= /(a 9 ) 


= r 2 (o; 9 ) 





^3,2« 12 + ^2,2« 3 + ^1,2« 9 + ^0,2 


F 7 


= /(a 7 ) 


= r 3 (a 7 ) 





r 3i3 a 6 + r 2j3 a 14 + ri i3 a 7 + r , 3 


Fu 


= f(« 14 ) 


= r 3 (a 14 ) 




r-i %cx 12 + ro -?a; 13 + r\ -?a 14 + rn s 


F13 


= /(« 13 ) 


= r 3 (a 13 ) 




^3,3" 9 + r 2 , 3 a n + ri i3 a 13 + r , 3 


F n 


= /(a 11 ) 


= r 3 (a n ) 




^3,3a 3 + ^2,3a 7 + r li3 a u + r , 3 


F 5 


= Z(« 5 ) 


= r 4 (a 5 ) 




r M a 5 + r , 4 


Fio 


= /(« 10 ) 


= r 4 (« 10 ) 




r M a 10 + r , 4 
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The Goertzel-Blahut algorithm in matrix form is 

7iF = VRf 



or 
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B. The novel method based on the Goertzel-Blahut algorithm 

In the field GF(2 4 ), the normal basis is (7 1 , 7 2 , 7 4 , 7 s ) = (a 6 , a 12 , a 9 , a 3 ); in its subfields, 
the normal bases are (a 5 , a 10 ) for GF{2 2 ) C GF(2 4 ), and (a ) for GF(2) C GF(2 4 ). 
Let us denote the basis circulants by 



L\ — L 2 



7 2 7 4 7 S 



r 



ry^ ^8 ^2 

\7 8 7 1 7 2 7 4 / 



a 5 a lcA 
a 10 a 5 



Using V k = L k M{, , we have the basis transformation matrices M k , k e [0,4]: 



1 1 x 
10 
110 

1111 



(M 2 ) 
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^ 1 1 ^ 
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10 1 

110 
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1 



1110 
10 10 
110 

Let us remember that the multipoint evaluation matrix for the binary conjugacy class 

(a 1 , a 2 , a 4 , a 8 ) is: 
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a a 5 



A Q 
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111 
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10 
111 
11 
1 

\ 



= SP 1 , 



where Pi is the matrix of preadditions. 
From Lemma 3 it follows that 

V = 
V 1 = 



1 

SP 1 



V 2 = V^Mly'M? = SP^Mfy'MT = SP 2 
V 3 = V^Mfy'M? = 1 SPi(M 1 T )- 1 M 3 T = SP 3 

where Pk, k £ [1,3], is the matrix of preadditions, and M k , k e [1,3], is the basis transfor- 
mation matrix. 

The multiplication of the 2 x 2 Moore- Vandermonde matrix V 4 by the vector is very simple: 







( 1 a 5 ^ 




r 0,4 | 


\FmJ 




I 1 « 10 ) 
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= v 4 

Using (2), we obtain 




V 



1 

1 1 



V = 



v 1 



1 



Vo 



V, 



V 




V 



v, 



V 



V 



Pi 



= DP, 



where I 2 is the 2x2 identity matrix, D is the block diagonal matrix of the multipoint 
evaluation matrices without preadditions, and P is the binary block diagonal matrix of 
combined preadditions. 

In matrix form, the DFT algorithm can be written as 

Ti F = VRf = D(PR)f 



or 



Fi 
F 2 

F* 



F 3 
F e 

F\2 

F 9 



F 7 
F u 

-Pi 3 

F u 



F 5 
\F W J 



( a 



a a 10 



a a 5 



a a 10 



a 1 a 6 



a 2 a 12 



a 



a" a" 



a a 5 



a a 10 



a a 5 



a a 10 



a 1 a 6 



a 2 a 12 



4 9 

or or 



or a" 



a a 5 



a a 10 



a a 10 



a 1 a 6 



a 2 a 12 



4 9 

a cr 



a" a" 



a a 5 



a a 10 



December 8, 2011 



DRAFT 



33 



1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 






ffo) 


1 











1 








1 


1 





1 





1 


1 


1 






h 








1 


1 





1 





1 


1 


1 


1 











1 






h 





1 


1 


1 


1 











1 








1 


1 





1 






h 











1 








1 


1 





1 





1 


1 


1 


1 






h 


1 











1 


1 











1 


1 











1 






h 





1 





1 








1 





1 








1 





1 









h 





1 








1 





1 








1 





1 








1 






h 





1 


1 


1 


1 





1 


1 


1 


1 





1 


1 


1 


1 






h 


1 


1 


1 





1 





1 


1 








1 











1 






h 





1 


1 








1 











1 


1 


1 


1 





1 






ho 








1 











1 


1 


1 


1 





1 





1 


1 






hi 





1 


1 


1 


1 





1 





1 


1 








1 












/l2 


1 





1 


1 





1 


1 





1 


1 





1 


1 





1 






/l3 





1 


1 





1 


1 





1 


1 





1 


1 





1 


1 


/ 




vW 



C. The complexity of the 15 -point DFT computation 

The novel method of the 15-point DFT computation based on the Goertzel-Blahut algo- 
rithm consists of three parts: 

1) triple multiplication by the matrix S (by the Moore- Vandermonde multipoint evaluation 
matrix V\ = SP\ without preadditions): 3 x 4 = 12 multiplications and 3 x 8 = 24 
additions; 

2) multiplication by the matrix V4. 1 multiplication and 2 additions; 

3) multiplication of the binary matrix PR by the vector / (using a heuristic algorithm 
[20]): 44 additions. 

The complexity of this method is 13 multiplications and 70 additions. 

The complexity of some methods of the 15-point DFT computation is shown in Table III. 

VII. Conclusion 

A novel method for computation of the DFT over a finite field with reduced multiplicative 
complexity is described. The method is applicable for any DFT length but reducing the 
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TABLE III 

The complexity of the 15-point DFT computation 



method 


multipli- 
cations 


addi- 
tions 


the cyclotomic algorithm [18] 


16 


77 


the cyclotomic algorithm using 

an improved heuristic algorithm [19] 


16 


76 


the recurrent algorithm [10] 


16 


91 


the cyclotomic algorithm with common 
subexpression elimination algorithm [7] 


16 


74 


the novel method based on 
the Goertzel-Blahut algorithm 


13 


70 


the novel method based on 
the recurrent algorithm 


13 


68 



multiplicative complexity is achieved not for all lengths. For the DFT computation of length 
n = 2 m — 1 in the field GF(2 m ) for even m the novel method is the best known method (if 
the number of multiplications is to be minimized), and the exact formula for the number of 
multiplications is analytically obtained. 
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